Picture for Xue Yang

Xue Yang

Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions

Add code
May 04, 2025
Viaarxiv icon

A Unified Agentic Framework for Evaluating Conditional Image Generation

Add code
Apr 09, 2025
Viaarxiv icon

Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation

Add code
Apr 09, 2025
Viaarxiv icon

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Add code
Apr 03, 2025
Viaarxiv icon

SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World

Add code
Mar 20, 2025
Viaarxiv icon

When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

Add code
Mar 10, 2025
Viaarxiv icon

Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?

Add code
Mar 10, 2025
Viaarxiv icon

GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

Add code
Mar 08, 2025
Viaarxiv icon

BadRefSR: Backdoor Attacks Against Reference-based Image Super Resolution

Add code
Feb 28, 2025
Viaarxiv icon

MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs

Add code
Feb 19, 2025
Viaarxiv icon